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Abstract 

Background: Mutation of amino acid sequences in a protein may have diverse effects on its structure and function. 
Point mutations of even a single amino acid residue in the helices of the non-redundant database may lead to 
sequentially identical peptides which adopt different secondary structures in different proteins. However, various 
physico-chemical factors which govern the formation of these ambivalent helices generated by point mutations of a 
sequence are not clearly known. 

Results: Sequences generated by point mutations of helices are mapped on to their non-helical counterparts in the 
SCOP database. The results show that short helices are prone to transform into non-helical conformations upon point 
mutations. Mutation of amino acid residues by helix breakers preferentially yield non-helical conformations, while 
mutation with residues of intermediate helix propensity display least preferences for non-helical conformations. 
Differences in the solvent accessibility of the mutating/mutated residues are found to be a major criteria for these 
sequences to conform to non-helical conformations. Even with minimal differences in the amino acid distributions of 
the sequences flanking the helical and non-helical conformations, helix-flanking sequences are found be more 
solvent accessible. 

Conclusions: All types of mutations from helical to non-helical conformations are investigated. The primary factors 
attributing such changes in conformation can be: i) type/propensity of the mutating and mutant residues ii) solvent 
accessibility of the residue at the mutation site iii) context/environment dependence of the flanking sequences. The 
results from the present study may be used to design de novo proteins via point mutations. 



Background 

Under physiological conditions, folding of proteins to 
well-defined three dimensional structures is crucial for 
executing their specific biological functions. To the extent 
that the folding pattern of proteins dictates function, 
modifying the structure may entail in either altering 
or disrupting its function. Point mutations in a pro- 
tein sequence, which are introduced by single amino 
acid residue replacements by site-directed mutagenesis, 
may yield different phenotypes by changing the native 
structure. The effect of point mutations are important 
in exploring various functional and structural features 
viz. protein sequenceaA§structure relationships [1], engi- 
neering protein stability [2,3], predicting the evolutionary 
dynamics of proteins [4-7], validating and refining vari- 
ous protein models and simulations [8-10], and designing 
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de novo proteins [11]. Mutations may also influence the 
local structure of a protein by changing the secondary 
structures of peptide sequences from one conformation to 
another [12,13]. It is established that the inherited prion 
diseases of humans are all linked to specific point and 
insertion mutations, which are responsible for the a -> 
transition leading to aggregation due to the formation of 
amyloid fibrils [14]. Similar conclusions are drawn from 
the theoretical studies of prion protein [15]. 

Experimental studies of mutating different positions in 
a protein structure with various amino acid residues are 
time-consuming and expensive process. Such an inves- 
tigation is facilitated by the three-dimensional modeling 
of polypeptide chain mutations. Experimental determi- 
nations of secondary structure propensity and stability 
[16-19] are usually studied through point mutations. The 
results obtained from the mutation of helices are used for 
rationalizing helix-coil transitions and compared to that 
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of Lifson-Roig [20] and Zimm-Bragg [21] models. How- 
ever, the knowledge of the type and position of mutations 
required for the transformation of helical —> non-helical 
conformation or vice versa is largely unclear. The compu- 
tational demands of such problems limit the number of 
mutations and size of sequences that may be considered. 
Alternatively, crystallographic structures from the Protein 
Data Bank [22] may be used as templates to explore point 
mutations leading to helical ambivalency. 

The present work aims at investigating the physico- 
chemical properties of ambivalent helices generated by 
point mutations. Ambivalent sequences conform to dif- 
ferent secondary structures in two different proteins. 
Kabsch and Sander [23] first reported the occurrence and 
physico-chemical properties of these sequences which 
were studied in great detail in the subsequent works 
[24-30]. Importance of these sequences lie in their impli- 
cation in the pathogenesis of amyloid diseases includ- 
ing Alzheimer's disease, transmissible bovine spongiform 
encephalopathy etc [31,32]. These sequences also chal- 
lenge the existing protein structure prediction methods 
based on sequence homology [33]. Recently we have 
shown that variable (ambivalent) helices, sequences which 
are in helical conformations in one protein and non- 
helical in other, have different physico-chemical, context 
and dynamical properties in comparison to conserved 
helices i.e. sequences which remain in helical conforma- 
tion throughout [34-36]. Although Argos [24] reported 
ambivalent sequences which differ by a single amino acid, 
no subsequent studies on the occurrence and proper- 
ties of these sequences were performed. However, these 
type of sequences are important as point mutations may 
cause secondary structure transformation, which may 
have significant implications in various misfolding dis- 
eases. Recent studies demonstrate that a conformational 
switch between two proteins with completely different 
structures and functions occurs via a single amino acid 
mutation [37]. 

This switching between the folds can occur in mul- 
tiple ways via successive single amino acid mutation at 
different positions [38]. Both theoretical [39] and compu- 
tational [40] studies have highlighted the bifunctionality 
of proteins via single point mutation. 

This work presents a detailed study of helical ambiva- 
lency induced by point mutations. Helices from the 
non-redundant database are point mutated at all residue 
positions and the resulting sequences are mapped onto 
the SCOP database to obtain completely non-helical con- 
formations. Previous analysis of physico-chemical prop- 
erties of ambivalent sequences were restricted to the 
local structure neglecting the global influence of the 
other parts of the protein [25,27-30,34]. Along simi- 
lar lines, the present study also analyzes the physico- 
chemical properties of ambivalent sequences induced by 



point mutations disregarding the effect on the remain- 
ing parts of the protein. The results show that smaller 
helices readily transform into non-helical conformations 
after point mutations. Sequences obtained by point muta- 
tions with helix-breaking residues usually correspond to 
non-helical conformations in the SCOP database. How- 
ever, point mutations of helix indifferent residues yield 
mutated sequences which has a low probability of con- 
forming to non-helical conformations. Differences in the 
solvent accessibility of the mutating/mutated residues in 
helical and non-helical conformations is found to be a 
major factor for the mutated sequence conforming to non- 
helical structures. Most of the mutated residues display 
different degree of solvent accessibility in helical and non- 
helical conformations. The non-helical conformations are 
found to be less solvent-accessible as compared to the 
original helices. Despite marginal differences in the amino 
acid distributions of the sequences flanking helical and 
non-helical conformations, the solvent accessibility of the 
sequences flanking helices are greater than that of the 
corresponding non-helical conformations. 

Methods 

Database and mutation 

The database used in the present study comprises the 
crystal structures from May-2008 release of PDB-select 
[41], which were compiled to create a database of non- 
redundant proteins from PDB [22] (Protein Data Bank). 
The database comprises protein chains with a sequence 
identity of 25% or less. All protein chains considered in 
this study have resolution < 3A and crystallographic R- 
factor, R < 0.3. The selected database consists of 2586 
non-redundant protein chains from 2466 protein struc- 
tures. Secondary structures are annotated residue-wise 
with the help of DSSP software [42]. According to the 
widely used definition, H and G denote helical conforma- 
tions while all other classes (B, E, I, S, T, -) are considered 
to be non-helical [43-45]. Neglecting helices with less 
than 5 residues long, there are 11592 helices in the non- 
redundant database. These helices were point mutated at 
each position by all 20 amino acids (excluding the amino 
acid present at the given position in wild type helix). So 
for a helix of length 5 amino acids this method will gen- 
erate (5X19) 95 mutated sequences. The total number of 
mutated helices generated from 11592 helices is 2662375 
which constitute the database of mutated sequences. 

Proteins from nine SCOP (Structural Classification 
of Proteins) [46] classes viz. (I)A11 alpha proteins, 
(II)All beta proteins, (III)Alpha and beta proteins(a+b), 
(IV)Alpha and beta proteins(a/b), (V)Coiled coiled pro- 
teins, (VI) Membrane and cell surface proteins and 
peptides, (VII)Multi-domain proteins(alpha and beta), 
(VHI)Peptides and (IX)Small proteins were compiled to 
obtain sequences identical to the mutated sequences. A 
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structural cut-off resolution < 3A and R < 0.3 were 
applied on these proteins with the PISCES server [47]. The 
resultant SCOP database consists of 48244 protein chains 
from 22309 protein structures. 

Both the non-redundant protein and SCOP database 
used in the present study are similar to our recent work 
on ambivalent helices [34] which helps in comparison 
between both the works. 

Identical sequence search 

The method for identifying the sequences identical to 
the mutated sequences in SCOP database in non-helical 
conformations is similar to previous method of search- 
ing variable and conserved helices [34] . However, a slight 
modification of the method was performed due to the 
large number of mutation generated sequences as com- 
pared to the original wild type helices. The method is 
outlined as follows: for a mutated sequence of N residues 
to be mapped onto a protein in the SCOP database of 
M residues, an NXM matrix is created where an ele- 
ment of the matrix, A(i,j)[i = 1 N,j = 1 M], 
is equal to 1 if i th position of the helix and f h posi- 
tion of the protein chain have identical residues and the 
conformation of the residue in j th position of the pro- 
tein in SCOP database is non-helical. Now if an element 

N-l 

A(k,l)[kei,kj] = 1 and £ A 0< + m,l + m) = AT, 

m=0 

where m is a running index, then the helix from non- 
redundant database is mapped from / to /+ N — 1 position 
of the SCOP database protein with non-helical confor- 
mations. Sequences identical to the mutated sequences 
with non-helical conformations are only identified in the 
present study. This method yields sequences identical 
to the mutated sequences with completely non-helical 
conformations only. Among 2662375 mutated sequences, 
28957 were mapped in 253728 sequences in the SCOP 
database with completely non-helical conformations. The 
list of 28957 mutation generated helical sequences are 
provided in Additional file 1: Table SI of supplemen- 
tary material. These 28957 mutated sequences are gen- 
erated from 1775 non-redundant helices through point 
mutations. These 1775 helices are denoted as wild type 
helices, 28957 mutated sequences as ambivalent mutated 
sequences and 253728 mapped sequences in the SCOP 
database as non-helical sequences respectively. 

Results and discussion 

Number distribution of mutation generated ambivalent 
helices 

The number of ambivalent mutated sequences (28957) 
which conform to non-helical structures in the SCOP 
database is plotted as a function of sequence length in 
Figure 1. From Figure 1 it is observed that the shorter 
mutated helical sequences (< 6) are more probable 



to conform to non-helical conformations in the SCOP 
database. However, a small number of sequences (~ 6%) 
of length > 7 are found to conform to non-helical 
conformations in the SCOP database. This further dips 
to ~ 2% when mapped 253728 non-helical sequences in 
SCOP are considered. For long helices, hydrogen bonds 
render stability to the overall structure which is not dis- 
rupted by the point mutations. This explains why few long 
helices generated by point mutations switch to non-helical 
conformations. 

The inset of Figure 1 highlights the region of sequence 
length > 7. An interesting phenomenon may be observed 
here. The number of mutated sequences with length in 
multiples of 3 (i.e. 9, 12,...) either shows a peak or a trough, 
which is similar to the trend observed in helices [48]. A 
larger number of these sequences imply a greater proba- 
bility of switching into non-helical conformations subject 
to point mutations. 

Propensity of conformation change with different 
mutations 

Table 1 depicts the number of times an ambivalent 
mutated sequence is mapped into non-helical conforma- 
tions in the SCOP database when a particular amino acid 
(first column) is mutated by 19 other amino acids (first 
row). A glance at the table shows that the number of 
non-helical conformations increases either by mutating 
an amino acid with helix breaking residues (e.g. columns 
starting with G, P) or substituting for a helix forming 
amino acid (e.g. row starting with L). 

It is computationally infeasible to generate sequences 
identical to the mutated helical sequences. However, it 
is possible to obtain the number of mutated sequences 
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Figure 1 Occurrence frequency vs. mutated sequence length. 

Frequency of occurrence of mutation generated sequences 
conforming into non-helical structure in the SCOP database (28957 in 
number) as a function of sequence length. Inset depicts a magnified 
view of sequence length between 8 - 20. 



Table 1 Number of times a helical sequence gets mapped into non-helical conformation in SCOP database when a particular amino acid (first column) is mutated 
by 19 other amino acids (first row) 
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Diagonal elements are zero as these mutations will lead to wild type helices. 
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which retain their helical conformations in the SCOP 
database. To estimate the number of different type of 
mutations, which retains the helical conformations, the 
probability of occurence of 20 amino acids in conserved 
helices is used from our earlier study [34]. The number 
of X^ Y mutations leading to the retention of helical 
sequences is obtained by multiplying the total number of 
mutations to generate mutated sequences in the database 
with the fraction of occurence of amino acids, Y, in the 
conserved helices. Table 2 depicts the estimated number 
of each type of amino acid mutations which preserve heli- 
cal conformations. The results show that a large number 
of sequences mutated by helix forming residues like Ala, 
Val etc. retain their helical conformation when mapped 
into SCOP database. 

For simplification we divide the 20 amino acids into 
three groups based on their helix forming propensity: F for 
helix forming amino acids which include A, E, F, H, L, M, 
Q, V and W, I for helix indifferent amino acids viz. C, D, I, 
K, R, S and T and B for helix breaking amino acids like G, 
N, P and Y [49,50]. This division is primarily based upon 
the statistical analysis values rather than experimental 
scales like thermodynamic stability [51]. The number of 
different types of mutation is reduced from 380(20X19) to 
3X3 = 9 viz. F^F, F— >T, F^B,...etc. The propensity of a 
particular mutation X^ Y, which transforms the sequence 
from helical to non-helical conformation, is defined as 

P 

Propensity = - (1) 
<l 

where p is the probability of occurrence of X^ Y mutation 
leading to 28957 ambivalent mutated sequences, while 
q denotes the probability of the same mutation in the 
database of mutated sequences. A propensity value > 1 
denotes a preference for the specific mutation to conform 
to the non-helical structures while < 1 implies that the 
mutation may not preferably yield the same. 

Figure 2 shows the propensity of a particular muta- 
tion leading to non-helical conformations with 9 different 
types of mutations. Propensity of mutations with helix- 
breaking residues have a higher probability to form non- 
helical structures in the SCOP database. 

However, propensity of mutation of helix indifferent 
residues shows that it is not favored for conforming into 
non-helical structures. 

Earlier results depict that amino acids have different 
preferences for the two termini of helices [48,52-57]. How- 
ever, the present work did not find any perceptible differ- 
ence in the propensities of different types of mutations at 
N- and C-terminus of the helical sequences. 

Interestingly propensity for B^F, B^I and B^B 
are found to be greater than 1 in Figure 2. While 
higher propensities for B^B are expected, a simi- 
lar trend for B^I and B^F may be explained by a 



marked increase in the solvent accessibility, which out- 
weighs the decrease in propensity yielding predomi- 
nantly non-helical conformations. Previous studies have 
shown that amino acid propensities are position specific 
with respect to both termini displaying an oscillatory 
behavior [48]. 

Moreover, helices are found to be ampiphilic in nature 
with respect to solvent accessibility and hydrophobicity 
[48,58-60]. 

Hence, mutation of helix breaking residues with helix- 
forming and helix-indifferent residues are position- 
specific, which may account for the behavior observed in 
Figure 2. 

Mutation generated sequences leading to non-helical 
conformations show change in solvent accessibility of the 
mutated site 

The solvent accessibility of each residue is calculated 
using DSSP [42]. The normalized solvent accessibil- 
ity is calculated as the ratio of this absolute value 
to the maximum solvent accessibility of amino acid 
residues found in Gly-X-Gly [61]. In this study, the 
residues are classified as buried or exposed according 
to their different degree of solvent accessibility. Buried 
residues are < 7% solvent accessible, while the exposed 
surface residues have > 37% relative solvent acces- 
sibility. Residues in the intermediate zone have rela- 
tive solvent accessibility in the range between 7% and 
37% [62]. 

Propensity of a X^Y mutation for different solvent 
accessible (buried/ intermediate/ exposed) residues of 
type X in the wild type helices is defined by equation 1. 
For a given solvent accessible residue of type X, p is 
the probability of occurrence of X^Y mutation in the 
ambivalent mutated sequences, while q is the proba- 
bility of the same mutation for a residue of the same 
solvent accessibility in database of mutated sequences. 
Figure 3 depicts the propensity of a given type of muta- 
tion for residues of different solvent accessibility in the 
helices of the non-redundant database, corresponding to 
non-helical conformations in the SCOP database. The 
results show that mutations of helix-forming and helix- 
indifferent residues at solvent accessible sites mostly 
yields non-helical conformations followed by mutation 
of residues having intermediate solvent accessibility, 
while an opposite trend is observed for mutations of 
helix-breaking residues. 

Mapping the entire dataset of all mutation generated 
sequences (2662375) in the SCOP database demands 
enormous computational time and resources and hence is 
out of the scope of the present work. Hence the propensity 
of mutation X^Y for residues of different solvent acces- 
sibility (buried/ intermediate/ exposed) of Y in 253728 
non-helical sequences of the SCOP database may be 



Table 2 Estimated number of times a helical sequence gets mapped into another helical conformation in SCOP database when a particular amino acid 
(first column) is mutated by 19 other amino acids (first row) 
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112 


309 


341 
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93 


260 


299 


233 


219 


316 


79 


0 



Diagonal elements are zero as these mutations will lead to wild type helices. 
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Figure 2 Propensity of different mutations. Propensity of 9 different types of mutations generating a helical conformation from the 
non-redundant database which map into non-helical conformation in the SCOP database. As depicted in Equation 1 , propensity here is the ratio of 
probability of a given type of mutation leading to ambivalent mutated sequences to the probability of the same type of mutation in the total 
database of mutated sequences. 



approximately calculated as the ratio of probabilities given 
by 

P 

Propensity = (2) 

qi x q 2 

where p is the probability of X^Y mutation in the 
non-helical sequences (253728) with a particular type of 
solvent accessibility of Y and q\ and q 2 are the prob- 
abilities of residue X and Y of same/different solvent 
accessibility in the non-redundant database and the SCOP 
database respectively. The denominator is normalized to 
unity for the different types of mutations. Figure 4 show 
the propensity of various types of mutations leading to 
different solvent accessibility of the mutated position. 
Excluding mutation of the helix forming residues (F) a 



completely opposite trend is observed in Figure 4 com- 
pared to that of Figure 3. This shows that mutating the 
helix-breaking and helix-indifferent residues the solvent 
accessibility changes, which makes the sequence non- 
helical. 

The above results motivated us to investigate the over- 
all solvent accessibility of the wild type helices, yielding 
the ambivalent mutated sequences which corresponds to 
the non-helical sequences in the SCOP database. Figure 5 
shows the fraction of wild type helices and their corre- 
sponding non-helical sequences in the SCOP database at 
different values of the average normalized solvent acces- 
sibility. The results show that mutated sequences in the 
non-helical conformations have lower solvent accessibil- 
ity i.e. they are buried in the interior of the protein 
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Figure 3 Propensity of mutated Residue with different solvent accessibility. Propensity of 9 different types of mutations at different solvent 
accessible positions (buried, intermediate and exposed) generating a helical conformation from the non-redundant database which map into 
non-helical conformation in the SCOP database. The propensity here depicts the ratio of probability of a given type of mutation at a given solvent 
accessibility leading to ambivalent mutated sequence to the probability of the same type of mutation at the similar solvent accessibility in the total 
database of mutated sequences. 
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Mutation 

Figure 4 Propensity of resultant residue with different solvent accessibility. Propensity of 9 different types of mutations resulting in different 
solvent accessible (buried, intermediate and exposed) residues generating a helical conformation from the non-redundant database which map 
into non-helical conformation in the SCOP database. The propensity depicted here is similar to that shown in Figure 3. However, while calculating 
this propensity solvent accessibility of the resultant mutant residue is considered rather than solvent accessibility of the mutated residue. 



compared to the parent helices. The data displayed in 
Figure 5 are validated through the t-test and the dif- 
ference is found to be highly significant with P-value 
P < 0.001 and T-value T = 25.3569. For more than 
65% cases, the non-helical conformations are found to be 
less solvent accessible as compared to the original helical 
conformations. 

Flanking sequences have vast differences in solvent 
accessibility though amino acid distributions are similar in 
them 

Previous studies have established that context plays a 
major role in determining the structure of an ambiva- 
lent sequences [28,30,34]. It was suggested earlier that 
the flanking amino acids of ambivalent sequences play 
an important role in determining the conformations of 
these sequences. This finding is also validated experi- 
mentally [63]. An analysis of the amino acid occurrence 



frequency in the flanking sequences of the wild type 
helices and those of the corresponding non-helical con- 
formations are also presented in this work. The flank- 
ing sequences are differentiated as N-terminus flanking 
sequences (four residues preceding the sequences) and 
C-terminus flanking sequences (four residues succeed- 
ing the sequences). Figure 6 shows the distribution of 20 
amino acids in these flanking sequences. The distribu- 
tion is depicted as the conformational parameter which is 
defined by, 

CP = — (3) 
Pi 

where Pg is the probability of i th residue in j th flanking 
sequence while Pi is the probability of i th residue in the 
non-redundant database. 

Conformational parameter > 1 depicts that the amino 
acid is preferred in that sequence while a value < 1 shows 
it is not preferred. 

Minimal differences are noticed for the flanking 
sequences of the wild type helices and their corre- 
sponding non-helical counterparts. This minimal dif- 
ferences in conformational parameters are in sharp 
contrast to those observed in chameleon/ambivalent 
sequences earlier [25,28,30,34,64]. In previous studies 
of chameleon/ambivalent sequences, significant differ- 
ences in the amino acid conformational parameters were 
observed in sequences flanking helical and non-helical 
conformations. In the present study significant differences 
are found only for the amino acids C, D, H and S at the 
Af-terminus and amino acids K, L, Q, T and V for the 
C-terminus flanking sequences. Since Cysteines tend to 
form disulfide bonds, their high occurrence in the flanking 
sequences of the wild type helices imparts extra stability 
to these structures. 




Average normalized solvent accessibility (%) 



Figure 5 Fraction of sequences vs. average solvent accessibility. 

Fraction of helices and their corresponding mutation generated 
sequences in non-helical conformations with different average 
normalized solvent accessibility. The solvent accessibility of each 
residue was calculated using DSSP and normalized as depicted in the 
text. 
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Figure 6 Conformational parameter of amino acids in flanking sequences. Conformational parameter of the amino acids in flanking sequences 
of (A) N-terminus and (B) C-terminus of helices and their mutated sequences in the non-helical conformations. Conformational parameter here 
refers to the ration of probability of an amino acid in the flanking sequences to probability of the same amino acid in non-redundant database. 



The solvent accessibility of the flanking sequences are 
also studied in the present work. Figure 7 shows the 
fraction of flanking sequences at different average normal- 
ized solvent accessibility. Despite close similarity in their 
amino acid preferences, these flanking sequences have 
vast differences in solvent accessibility. The sequences 
flanking the non-helical conformations at both N- and 
C-terminus are less solvent accessible compared to those 
flanking the wild type helices. The significance of the 
result is validated by the t-test, which gives a P-value of 
P < 0.001 and T-value T = 20.3459. Similar differences 
in the solvent accessibility of the flanking sequences were 
observed in earlier studies for chameleon/ambivalent 
sequences [25,28,30,34]. 

Conclusions 

Helices from the non-redundant database are point 
mutated to yield mutated sequences which are mapped 
in the SCOP database to obtain identical sequences 
with non-helical conformations. The results of this study 
confirm that small helices are more prone to non- 
helical conformations on point mutations. Mutations with 
helix breaking residues mostly transform the mutated 
sequence to non-helical conformations, while mutations 
of helix indifferent residues hardly yield non-helical 
conformations. 

Solvent accessibility of mutating/ mutated residues can 
be a primary factor for the helices to conform to non- 
helical conformations following point mutations. Most 



mutated residues are found to differ in the degree of 
solvent accessibility in helical and non-helical confor- 
mations. The non-helical conformations obtained after 
point mutations are found to be less solvent accessible as 
compared to the original helical conformations. Minimal 
differences appear in the amino acid distributions for the 
flanking sequences of helices in non-redundant database 
and their corresponding non-helical conformations in the 



□ Helical 



Non helical 




Average normalized solvent accessibility (%) 

Figure 7 Average solvent accessibility of flanking sequences. 

Fraction of (A) N-terminus and (B) C-terminus flanking sequences of 
helices and their mutated sequences in the non-helical 
conformations with different average normalized solvent accessibility. 
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SCOP database. However, the solvent accessibility of the 
flanking sequences of helices are vastly different from 
those of the respective non-helical conformations which 
are buried in the protein interior. 
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